Torpid Mixing of Local Markov Chains on 3-Colorings 

of the Discrete Torus 
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Abstract 

We study local Markov chains for sampling 3-colorings 
of the discrete torus T L , d = {0, ...,£ - l} d . We 
show that there is a constant p « .22 such that for 
all even L > 4 and d sufficiently large, certain local 
Markov chains require exponential time to converge to 
equilibrium. More precisely, if M. is a Markov chain on 
the set of proper 3-colorings of Tt t d that updates the 
color of at most pL d vertices at each step and whose 
stationary distribution is uniform, then the convergence 
to stationarity of A4 is exponential in L d ~ x . Our proof 
is based on a conductance argument that builds on 
sensitive new combinatorial enumeration techniques. 

1 Introduction 

Sampling and counting colorings of a graph are fun- 
damental problems in computer science and discrete 
mathematics. We consider the problem of sampling uni- 
formly at random from the set Ck = Ck(G) of proper 
fc-colorings of a graph G — (V,E). A proper fc-coloring 
X is a labeling \ '■ V ~* k such that all neighboring 
vertices have different colors. This sampling problem is 
also fundamental in statistical physics and corresponds 
to generating configurations from the Gibbs distribu- 
tion of the zero-temperature antiferromagnetic Potts 
model |20) . From the physics perspective, the under- 
lying graph is typically taken to be the cubic lattice Z d 
and sampling and counting reveal underlying thermody- 
namic properties of the corresponding physical system. 

Much focus has gone towards solving the sampling 
problem using rapidly mixing Markov chains. The 
idea is to design a Markov chain whose stationary 
distribution is uniform over the set of proper colorings. 
Then, starting at an arbitrary coloring and simulating 
a random walk according to this chain for a sufficient 
number of steps, we get a sample from close to the 



desired distribution. The number of steps required of 
this walk is referred to as the mixing time (see, e.g., 
|17j). The chain is called rapidly mixing if the mixing 
time is polynomial in n = \V\ (so it converges quickly 
to stationarity); it is torpidly mixing if its mixing time 
is super-polynomial in n (so it converges slowly) . There 
has been a long history of studying mixing times of 
various chains in the context of colorings (see, e.g., 

ID ©Ell ED 113 03]). 

A particular focus of this study has been on Glauber 
dynamics. For proper fc-colorings this is any single-site 
update Markov chain that connects two colorings only 
if they differ on at most a single vertex. The Metropolis 
chain Aik on state space Ck has transition probabilities 
Pk(XuX2), Xi,X2 GCfc, given by 



Pk{X\,X2) 



fO, if \{v€V: X i(v)^X2(v)}\ >1; 
i£\{v€V:xi(v)^ X 2(v)}\ = l; 

^-E Xl ^ 2 ec k Mxi,X2), 
if X.\ = X.2- 
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We may think of M.k dynamically as follows. From a 
fc-coloring \, choose a vertex v uniformly from V and 
a color j uniformly from {0, . . . , k — 1}. Then recolor v 
with color j if this is a proper fc-coloring; otherwise stay 
at x- 

When Mk is ergodic, its stationary distribution 7Tfc 
is uniform over proper fc-colorings. A series of recent 
papers have shown that Mk is rapidly mixing provided 
the number of colors is sufficiently large compared to 
the maximum degree (see [B] and the references therein). 
Substantially less is known when the number of colors 
is small. In fact, for k small it is NP-complete to decide 
whether a graph admits even one fc-coloring. 

In this paper we focus on the mixing rate of Aik 
on rectangular regions of the cubic lattice Z d . Observe 
that the lattice is bipartite, so it always admits a fc- 
coloring for any fc > 2. It is also known that Glauber 
dynamics connects the state space of fc-colorings on any 
such lattice region [15]. In Z 2 much is known about 



the mixing rate of Aik- Luby et al. [15] showed that 
Glauber dynamics for sampling 3-colorings is rapidly 
mixing on any finite, simply-connected subregion of Z 2 
when the colors on the boundary of the region are 
fixed. Goldberg et al. [TU] subsequently showed that 
the chain remains fast on rectangular regions without 
this boundary restriction. Substantially more is known 
when there are many colors: Jerrum [T2] showed that 
Glauber dynamics is rapidly mixing on any graph 
satisfying k > 2 A, where k is the number of colors 
and A is the maximum degree, thus showing Glauber 
dynamics is fast on Z 2 when k > 8. It has since been 
shown that it is fast for k > 6 [2 [3] . Surprisingly the 
efficiency remains unresolved for k = 4 or 5. 

In higher dimensions much less is known when k is 
small. Physicists have performed extensive numerical 
experiments [5j [19] suggesting that Glauber dynamics 
on 3-colorings is torpidly mixing when the dimension 
of the cubic lattice is large enough. We prove this 
conjecture for the first time here by studying the 
mixing time of the chain on cubic lattices with periodic 
boundary conditions. 

1.1 Results Our focus in this paper is sampling 3- 
colorings of the even discrete torus Tl }C i- This is the 
graph on vertex set {0, . . . , L — l} d (with L even) with 
edge set consisting of those pairs of vertices that differ 
on exactly one coordinate and differ by 1 (mod L) on 
that coordinate. For a Markov chain M. on the 3- 
colorings of Tj,d we denote by tm the mixing time of 
the chain; this will be formally defined in Section[5] Our 
main theorem is the following. 

THEOREM 1.1. There is a constant do > for which 
the following holds. For d > do and L > 4 even, the 
Glauber dynamics chain M. 3 on C^{Tt,^) satisfies 

T ^- eXP {^l}' 

Our techniques actually apply to a more general 
class of chains. A Markov chain M. on state space C3 
is p-local if, in each step of the chain, at most p\V\ 
vertices have their color changed; that is, if 

Pm(xi,X2)¥=0 

implies 

\{veV : X i(v)^X2(v)}\<p\V\. 

These types of chains were introduced in [4,, where the 
terminology p\V\ -cautious was employed. We prove the 
following, which easily implies Theorem ll.il 



THEOREM 1.2. Fix p > satisfying H{p) + p < 1. 
There is a constant do = rfo(p) > for which the 
following holds. For d > do and L > 4 even, if M. is an 
ergodic p-local Markov chain on C^{TL,d) with uniform 
stationary distribution then 

f L^ 1 1 
tm > exp < — — 2— V . 

{ d 4 log 2 L J 

Here H(x) — — x\ogx — (1 — x] log(l — x) is the usual 
binary entropy function. Note that all p < .22 satisfy 
H{p) + p<\. 

1.2 Techniques We show slow mixing via a conduc- 
tance argument by identifying a "bad cut" in the state 
space requiring exponential time to cross. Intuitively, in 
sufficiently high dimension, the set of 3-colorings of the 
lattice is believed to naturally partition into 6 classes: 
each class is identified by a predominance of one (of 
3) colors on one of the two (even or odd) sublattices. 
This characterization was recently rigorously verified on 
the infinite lattice, thereby establishing the existence of 
6 distinct "maximal entropy Gibbs states" [9]. That 
work builds heavily on technical machinery introduced 
by Galvin and Kahn [5] showing that independent sets 
partition similarly in sufficiently high dimensions in that 
they lie primarily on the even or odd sublattices. Specif- 
ically, write 8 and O for the sets of even and odd vertices 
of Z d (defined in the obvious way) and set = [-L, L] d 
and dA L = [-L, L] d \ \—{L -1),L- l] d . For A > 0, 
choose I from I(A^) (the set of independent sets of the 
box) with Pr(I = i") oc A^L Galvin and Kahn showed 
that for A > Cd- 1 ^ log 3/4 d (for a large constant C) 
and fixed v £ Al PI £ 

lim F(v G I I I D dA L D £) 

L—too 

> lim P(u G I I I 2 dA L n O) . 

L— >oo 

In other words, the influence of the boundary on the 
center of a large box persists as the boundary recedes. 

Notice that neither of the results of [5] or [5] es- 
tablishing the presence of multiple Gibbs states directly 
implies anything about the behavior of Markov chains 
on finite lattice regions. However, they do suggest that 
in the finite setting, typical configurations fall into the 
distinct classes described in stationarity and that it will 
be unlikely to move between these classes; the remain- 
ing configurations are expected to have negligible weight 
for large lattice regions, even when they are finite. 

Galvin [7] extended the results of [8], showing that 
in sufficiently high dimension, Glauber dynamics on in- 
dependent sets mixes slowly in rectangular regions of 
Z d with periodic boundary conditions. Similar results 



were known previously about independent sets; how- 
ever, one significant new contribution of [7] was showing 
that as d increases, the critical A above which Glauber 
dynamics mixes slowly tends to 0. In particular, there 
is some dimension do such that for all d > do, Glauber 
dynamics will be slow on 1 d when A = 1. This turns 
out to be the crucial new ingredient allowing us to rig- 
orously verify slow mixing for sampling 3-colorings in 
high dimensions, as there turns out to be a close con- 
nection between the independent set model at A = 1 and 
the 3-coloring model. Note that unlike most statistical 
physics models, the 3-coloring problem does not have a 
parameter A that can be tweaked to establish desired 
bounds; this makes the proofs here significantly more 
delicate than the usual slow mixing arguments. Section 
13.11 provides a more detailed discussion of the elements 
of the proof and of some of the difficulties inherent to 
the sampling problem under discussion. 

2 Partitioning the state space 

We begin by formalizing some definitions. Given a 
Markov chain M. on state space f2 with uniform sta- 
tionary distribution denoted by 7T, let P t (X,-) be the 
distribution of the chain at time t given that it started 
in state X. The mixing time tm of A4 is defined to be 

1 



tm = min^o : ||P ,7r|| tv < - V< > t 



where 



Yen 



is the total variation distance. 

We prove Theorem 11.21 via a well-known conduc- 
tance argument [13l [TH [18] , using a form of the argu- 
ment derived in [4]. As above, let M. be an ergodic 
Markov chain on state space with transition proba- 
bilities P and stationary distribution ir. Let A C il 
and M C!l\i satisfy tt(A) < 1/2 and uji e A,w 2 6 
n\(AU M) P(ui,u} 2 ) = 0. Then from g] we have 



(2.1) 



Stt(M) ' 



Let us return to the setup of Theorem ll.2l For even 
L, Ti,,d is bipartite with partition classes £ (consisting 
of those vertices the sum of whose coordinates is even) 
and O. To show torpid mixing, it is sufficient to identify 
a single bad cut. We concentrate on the vertices in 
each 3-coloring that are colored with the first color, 
0. The objective of Theorem 11.21 will be to verify that 
most 3-colorings have an imbalance whereby the vertices 
colored lie predominantly on £ or O, and those that 



are roughly balanced on the two sublattices are highly 
unlikely in stationarity. This is sufficient to show that 
the conductance is small. 

Accordingly let us define the set of "balanced" 3- 
colorings by 

C b 3 '" = { x eC 3 : 11x^(0) n^l-lx-^O) nO|| <pL d /2} 

and likewise let 

Cf p = { x eC 3 : \ x -\0) n£\> |x^(0) n o\ + P L d /2}. 

By symmetry, n 3 (C 3 ' p ) < 1/2. Notice that since Ai 
updates at most pL d vertices in each step, we have 
that if xi G Cf lp and X 2 G C 3 \ (Cf ' p U C b 3 p ) then 
Pm(xuX2) = 0. Therefore, by (231, 



TM > 



Met) ^ i-^(cf' p ) 

-}b,p 



> 



87r 3 (C 3 ^) " IGtts^) 



and so Theorem 11.21 follows from the following critical 
theorem. 

THEOREM 2.1. Fix p > satisfying H(p) + p < 1. 
There is a constant do = do(p) > for which the 
following holds. For d > do and L > 4 even, 



7T3( C 3' P ) ^ ex P 



-2L 



d-l 



d 4 log 2 L 



3 Proof of Theorem [27T1 

3.1 Setup and overview For a generic x G C 3 " 
there are regions of 7l j£ j consisting predominantly of 
even vertices colored together with their neighbors, 
and regions consisting of odd vertices colored together 
with their neighbors. These regions are separated by 
two-layer "0-free" moats or cutsets. In Section 13.21 we 
describe a procedure that selects a particular collection 
of these cutsets. Our main technical result, Lemma 
13. 1 1 asserts that for each specification of cutset sizes 
ci, . . . , ct and vertices v\, . . . ,vi, the probability that a 
coloring has among its associated cutsets a collection 
71, . . . , j£ with — Ci and with Vi surrounded by 7, is 
exponentially small in the sum of the Cj's. This lemma 
is presented in Section 13.31 and Theorem 12.11 is derived 
from it in Section l3Tl 

The main thrust of [9] is the proof of a result that 
is essentially (but not quite) the case I = 1 of Lemma 
13.11 One difficulty we have to overcome in moving 
from a Gibbs measure argument to a torpid mixing 
argument is that of going from bounding the probability 
of a configuration having a single cutset to bounding 
the probability of it having an ensemble of cutsets. 
Another difficulty is that the cutsets we consider in these 
ensembles can be topologically more complex than the 



connected cutsets that are considered in [9]. In part, 
both of these difficulties are dealt with by the machinery 
developed in [7]. 

We use a "Peierl's argument" to prove Lemma |3~T1 
By carefully modifying each \ S C 3 ' p inside its cutsets, 
we can exploit the fact that the cutsets are 0-free to 
map x to a set v?(x) °f many different \' g C 3 . If the 
<£>(x)'s were disjoint for distinct x's, we would essentially 
be done, having shown that there are many more 3- 
colorings in total than 3-colorings in C 3 ' p . To control the 
possible overlap, we define a flow v : C 3 ' p x C 3 — > [0, 00) 
supported on pairs (x, x') with \' € fix) m such a way 
that the flow out of each x G C 3 ' p is 1. Any uniform 
bound we can obtain on the flow into elements of C3 
is then easily seen to be a bound on ■k 3 {C\' p ). We 
define the flow via a notion of approximation modified 
from 8 . To each cutset 7 we associate a set A (7) 
that approximates the interior of 7 in a precise sense, 
in such a way that as we run over all possible 7, the 
total number of approximate sets used is small. Then 
for each \' G C3 and each collection of approximations 
Ax,...,Ai, we consider the set of those x S 
with x' € y(x) and with A4 the approximation to 7^. 
We define the flow so that if this set is large, then 
v {Xi x') is small for each x in the set. In this way we 
control the flow into \' corresponding to each collection 
of approximations A\,...,Af, since the total number 
of approximations is small, we control the total flow 
into x' ■ In the language of statistical physics, this 
approximation scheme is a course- graining argument. 
The details appear in Section 01 

The main results of [7J and [5] are proved along 
similar lines to those described above. One of the dif- 
ficulties we encounter in moving from these arguments 
on independent sets to arguments on colorings is that of 
finding an analogous way of modifying a coloring inside 
a cutset in order to exploit the fact that it is 0-free. The 
beginning of Section[3](in particular Claims rPl and POj) 
describes an appropriate modification that has all the 
properties we desire. 

3.2 Cutsets We describe a way of associating with 
each x G C 3 ' p a collection of minimal edge cutsets, 
following the approaches of [5] and [7J. First we need a 
little notation. 

Write V for the vertex set of and E for its 
edge set. For X C V, write V(X) for the set of edges 
in E that have one end in X and one end outside X; 
X for V \ X; di n tX for the set of vertices in X that 
are adjacent to something outside X; d ex tX for the set 
of vertices outside X that are adjacent to something 
in X; X+ for X U d ext X; X s for X n £ and X° for 
X n O. Further, for x G V set dx = d ex t{x}. We abuse 



notation slightly, identifying sets of vertices of V and 
the subgraphs they induce. 

For each X € C 3 ' p set I = I(x) = X _1 ( )- Note 
that I(x) is an independent set (a set of vertices no 
two of which are adjacent). For each component R 
of (I s ) + or and each component C of R, set 

7 = 7RC (I) = V(C) and W = W RC (I) = C. Evidently 
C is connected, and W consists of R, which is connected, 
together with a number of other components of R, each 
of which is connected and joined to R, so W is connected 
also. It follows that 7 is a minimal edge-cutset in Tl^. 
Say that 7 is even if R is a component of (I £ ) + and 
odd otherwise. Define int 7, the interior of 7, to be the 
smaller of C, W (if \W\ = \C\, take int 7 = W). 

The cutsets 7 associated to x depend only on the 
independent set 7(x)> and coincide exactly with the 
cutsets associated to an independent set in [Jj. We 
may therefore apply the machinery developed in [7J 
for independent set cutsets in the present setting. In 
particular, from [7J Lemmas 3.1 and 3.2] we know that 
for each x € C3 there is a collection of associated cutsets 
r(J) such that either 



for all 7,7' G r(J), 
7, 7' are even with int 7 n int 7' 

'7) 



and I £ C U 7 int ' 



(3.2) 



or we have the analogue of (|3.2j) with even replaced by 
odd. Set Cf en = {x G C3 : x satisfies ([3T2])}. From 
here on whenever x G C™ en is given we assume that 
/ is its associated independent set and that T(I) is a 
particular collection of cutsets associated with x and 
satisfying (|3.2[) . Numerous properties of 7 G T(I) are 
established in [7J Lemmas 3.3 and 3.4]. We list some 
here that will be of use in the sequel. That the cutsets 
are indeed 0-free regions is established by (|3 .4[) . 



(3.3) 



dintW C O and 8UWC£: 



(3.4) <9 int IFn/ = and d ext W D I = 0; 

(3.5) = a ea;t VF £ and W £ = {y G £ : dy C 14^°} ; 

(3.6) for large enough d, | 7 | > maxjll^l 1 - 1 /' 1 , d 19 }. 
3.3 The main lemma For c G N and w G V set 



, 7 G r(/) for some x G Cf 6 ™ 



and set W = U c ,„W(c, w). A profile of a collection 
{71, ...jJCWisa vector p = (ci, «i, . . . , c e , vg) with 
7i G W(cj, Uj) for all i. Given a profile vector p set 



C 3 (p) - x e C 3 e 



r(7) contains a subset 
with profile p 



Our main lemma (c.f. [?] Lemma 3.5]) is the following. 

Lemma 3.1. There are constants c,do>0 such that the 
following holds. For all even L > 4, d > do and profile 
vector p, 



(3.7) 



7T3(C 3 b)) < exp 



= 2 



We will derive Theorem 12.11 from Lemma 13.11 in 
Section l3T4l before proving the lemma in Section|U From 
here on we assume that the conditions of Theorem 12.11 
and Lemma 13.11 are satisfied (with do sufficiently large 
to support our assertions). 

3.4 Proof of Theorem 12.11 assuming the main 
lemma We begin with an easy count that dispenses 
with colorings where \I(x)\ is small. Set 



= {x e C 



b, P 

3 



'!} < L^Ad 1 ' 2 } 



Lemma 3.2. 7r 3 (Cf ma ") < exp {-n(L d )} . 

Proof: For any A C £ and B C O, let comp(/l, B) be 
the number of components in V \ (A U B U d* A U d*B), 
where for T C £ (or O), 

d*T = {x £ d ext T : dx C T} (= {x € V : dx C T}). 

We begin by noting that by £-0 symmetry 



(3.8) 



small I 



< 2 2i a * A i+i 9 *- B i+ com p( A '- B ) 



where the sum is over all pairs A C £, B C O with no 
edges between A and B and satisfying \A\ < Z/ d /4d 1//2 
and \B\ < (p + l/2d 1 / 2 )L d /2. Indeed, once we have 
specified that the set of vertices colored is A U B, 
we have a free choice between 1 and 2 for the color 
at x 6 d*A U d*B, and we also have a free choice 
between the two possible colorings of each component 
of V\ (AUB{Jd*A{Jd*B). 

A key observation is the following. For A and B 
contributing to the sum in (|3.8[) . 



(3.9) 



comp(A, B) < L d /2d. 



To see this, let C be a component of V \ (A U B). If 
C = {v} consists of a single vertex, then (depending on 
the parity of v) we have either dv C A or dv C B and 
so v £ d*A U d*B. Otherwise, let vw be an edge of C 
with v £ £ (and so w £ O). If v has k edges to B and 
u has £ to A, then (since there are no edges from A to 



B) we have (fc - 1) + {I - 1) < 2d - 2 or k + I < 2d. 
(Here we are using that in T^, if uv G E then there 
is a matching between all but one of the neighbors of u 
and v.) Since v has 2d — 1 — k edges to \ (B U {w}) 
and iu has 2d — 1 — I edges to £ \ (AU {v}) we have that 
|C| = 4d - (fc + I) > 2d. From this, §^ follows. 

Inserting f|3.9f) into (|3.8|) and bounding and 
\d*B\ by the maximum values of |A| and \B\ (valid since 
TC£ (or O) satisfies \T\ < \d ext T\, so \d*T\ < \T\) and 
with the remaining inequalities justified below, we have 



\psmal. 
I u 3 



< 2 ^(p+^+i). ^ 



z<L d /4d 1 /2 



L d /2 



E 



L d /2 

i 



j<(p+l/2d 1 /2)Ld/2 

(3.10) < 2"^( p+ ^ + ^ +ff (^T72-) +H ( p+ ^T72)) 

(3.11) < 2^( 1 -°< 1 » 

for sufficiently large d = d(p). In (|3. 10[) we use the 
Chernoff bound £l=f (f ) < 2 H ^ M for /3 < ±; in 
(j3~TT1) we use H(p) + p < 1. Using 2 L "/ 2 < |C 3 |, the 
lemma follows. □ 
We now consider 

Qlarge,even (q} 3 ^ \ Q srn - a ll\ p Qeven 

3 V 3 \ 3 / 3 



By Lemma l3.2l and £-0 symmetry, Theorem 12 . 1 1 reduces 
to bounding (say) 



(3.12) 



7i"3(C 3 ) < exp 



3L d ~ 



d 4 log 2 L 



Let Ql ar 9 e ' even i n t 



be the Set Of Y e C 'ar- 9 e,e«e n 

such that there is a 7 € r(7) with I7I > (wc 
think of such cutsets as being topologically non-trivial 
("nt"); see [7] for an explanation of this) and also let 



Qlarge,even,triv £jlarge,even ^ q 

that 



large, even,nt 
3 



We assert 



(3.13) 



and 



(3.14) 



//ilarqe.even.nt \ 

^3(^3 ) 



< exp 



-n 



L 



d-l 



{ rrlarqe,even,triv \ 

7i"3(C 3 ) < exp 



d-l 



d 4 log 2 L 



this gives (|3.12j) and so completes the proof of Theorem 
12.11 Both (|3.13j) and (|3.14[) are corollaries of Lemma 
13. 1[ and the steps are identical to those that are used to 



bound the measures of "27 
in [7j Section 3.3]. 



non— trivial^ 
arge,even 



and a X\ 



trivial 
arge,even 



With the sum below running over all vectors p of 
the form (c,v) with v € V and c > and with the 

inequalities justified below, we have 

MC 3 ) < >^7r 3 (C 3 (p)) 



(3.15) 



p 



< L 2d exp<j -fi 



L 



d-l 



< exp < — il 



L 



d-l 



giving (|3.13|) . We use Lemma l3~Tl in (|3.15[> . The factor 
of L 2d is for the choices of c and v. 

The verification of (|3.14l) involves finding an i £ 
[f2(logd),0(dlogi)] and a set 1^(1) C r(7) of cutsets 
with the properties that \Ti(I)\ « L d /2 l , | -y ] « 2 4 for 
each 7 e ^(J) and X) 7 er;(/) M ~ The measure 

oi C 3 is tnen at most the product ot a term 

that is exponentially small in L d ~ l (from Lemma T3.ip . 
a term corresponding to the choice of a fixed vertex in 
each of the interiors, and a term corresponding to the 
choice of the collection of lengths. The second term will 
be negligible because I\(J) is small and the third will 
be negligible because all 7 <G r,;(7) have similar lengths. 

More precisely, for \ S c l £ r 9 e ' even ' tr " lv anc } 7 g T(I) 
we have I7I > |int7| 1_1 / d (by (|3.6jl ) and so 

^ | 7 |d/(d-i) > ^ |int 7 | > |/ £ | > L d /4d^ 2 . 
T er(i) T er(i) 



The second inequality is from (|3.2j) and the third follows 
since x £ C| mo ". 

Set Ti(I) = {7 e r(J) : 2 1 - 1 < | 7 | < 2 4 }. Note 
that T^I) is empty for 2' < d 1,9 (again by (|3.6p ) and 
for 2 l_1 > so we may assume that 

(3.16) 1.91ogd< i < {d- l)logi + 1. 

Since 53m=i I/™ 2 = 7I " 2 /6, there is an i such that 

L d 



(3.17) J] | 7 |^>^ 

7er,(/) 



d l/2j2 



Choose the smallest such i set £ = (X) | . We have 
S 7 er ■(/) ItI — ^(^2 4 ) (this follows from the fact that 
each 7 g ri(I) satisfies I7I > 2 4_1 ) and 



C-j.iS) (^-)>£>n(—^ 



The first inequality follows from that fact that ^ \j\ < 
dL d = \E\; the second follows from fl3.17[) and the fact 
that each 7 has \ 1 \ d /( d - 1 ) < 2 dz /^ 1 l We therefore 



have x € C 3 (p) for some p = (ci, v±, ■ . . , ty, vg) with £ 
satisfying (|3.18j) . with 



(3.19) 

with 
(3.20) 



^ Cj >0(^2 1 ), 



Cj < T 



for each j and with i satisfying (|3.16[) . With the sum 
below running over all profile vectors p satisfying p,16[) . 
f3~T8l) . (l3~T9l) and (j3~20l) we have 



(3.21) tt 3 (C 



large, even, triv 



) < ^7T3(C 3 (p)). 



The right-hand side of (|3.21| is, by Lemma [3~Tl at most 

-fL d \ ( ( £2 i 

dlogL max 2 ll [ exp < —CI — 

i satisfying (l3~16l) V * / I V d 

The factor of d log L is an upper bound on the number of 
choices for i; the factor of 2 l% is for the choice of the Cj's; 

and the factor ( L £ ) is for the choice of the £ (distinct) 
Vj's. By (|3.16[) and the second inequality in (|3.18[) we 
have (for c? sufficiently large) 



< 2 u (o(2^i 2 d 1 



/2 



< 2 



4ft 



exp < o 



so that in fact the right-hand side of (13.211) is at most 



2 l £ 



dlogL max exp <^ —57 ( —j- 



Taking £ as small as possible we see that this is at most 

2 i L d 



dlogL max exp < — Q 



and taking i as large as possible we see that this is at 
most exp{— 4L d_1 /d 4 log 2 L}. Putting these observa- 
tion together we obtain (|3.14p . 

4 Proof of Lemma 13.11 

Much of what follows is modified from [7] and [|5]. Our 

strategy is as follows. Let p — (c\, Vx, . . . , ct, Vi) be 
given. Set p' — (02, v<i, . . . , ci, vi). We will show 



(4.22) 



^3(^3 (p)) 



from which the lemma follows by a telescoping product. 
To obtain (|4.22|) we define a one-to-many map ip from 
C$(p) to C 3 (p'). We then define a flow v : C 3 (p) x 
Cz{p') — > [0,oo) supported on pairs (x>x') wrt h x' € 
fix) satisfying 

(4.23) v x ec 3 (p), Yl Kx,x') = i 

x'ev(x) 

and 
(4.24) 

Vx'eW), £ K X , X ') <exp{-0(^-)}. 

X6V~ 1 (x') 

This easily gives (|4~22")) . 

For each s £ {±1 5 ■ • ■ , define <r s , the s/ii/f in 
direction s, by cr s (ir) = x + e s , where e s is the sth 
standard basis vector if s > and e s = — e_ s if s < 0. 
ForXCV write cr s (A) for {cr s (a;) : x € X}. For 7 € W 
set f s ={i£ cUW : a_ s (a;) £ W}. 

Let x S Ca(p) be given. Arbitrarily pick 7 € 
r(J) n W(ci,ui) and set = hit 7. Write / for the 
map from {0,1,2} to {0,1,2} that sends to and 
transposes 1 and 2. For each s £ {±l,...,±d} and 
SCW define the function Xs ■ V {0, 1, 2} by 

[ if « € 5 

Xs(«) = i X(v) i£ve(W s \S)U(V\W) 
{ f(x(*- s (v))) iiveW\W s 

and set <p s ( X ) = { X s s ■ S C W s }. 

Claim 4.1. ^ s ( x ) CC 3 (^). 

Proof: An easy case analysis verifies <^ s (x) C C 3 . 
We begin with the observation that the graph di nt W U 
d ext W is bipartite with bipartition (d int W, d ext W). 
This follows from (J33J). By ln{d mt WUd ext W) = 

and so for each component U of di nt W U d ext W, x is 
constant on UOdi n tW and on UDd ex tW and in neither 
case does it take on the value 0. 

Fix S C VF S . We show that if {u,i>} is an edge of 
Tlm then x|( u ) 7^ Xs( w )- We consider five cases. 

If u,v £ W then XsW = xW and x|( w ) = x( v )- 
But x(u) 7^ X(w). so Xs( u ) 7^ Xs( w ) in this case. 

If u £ W and v £ W then Xs( v ) = x( v ) and 
Xs( u ) ^ {0:X( U )} ( we w iH justify this in a moment). 
Since v € d ex tW we have x( v ) and we cannot ever 
have x( v ) — x( M ): so Xs( u ) Xs( v ) m tms case. To see 
that Xs( u ) e {0jX( u )}i we consider subcases. If u G 5 
then xM u ) = °- If m € W s \ 5 then Xs( u ) = 
Finally, if u € W \ W s then xj( u ) = f(xl^-s(u))); 
and /(x(c-s(w))) is either or x( u ) depending on 
whether x(cr_ s (u)) equals or x( v ) (xi a -s(u)) cannot 
equal x ("))• 



H u,v £ W\W S then x"s(u) = / (x(cr_ a (tt))) and 
Xs( v ) — /(x( a -s(w)))- Since / is a bijection and 
x(o-- s (u)) ^ x(o--s(w)) we have x s s (u) ^ Xs( w ) in tllis 
case. 

If u £ W \ W s and v € W s \ S then X s s ( u ) e 
{0,x(u)} (as in the second case above) and Xs( v ) = 
x(w). Since x(w) 7^ 0, we have Xs( u ) ^ Xs( v )- 

Noting that it is not possible to have both u, v € 
W^ s , we finally treat the case where u £ W \ W s and 
v e S. In this case Xsi v ) — x{ v ) = 0- Suppose (for a 
contradiction) that Xs( u ) — 0- This can only happen if 
x(cr_ s (w)) = 0. If a- s (u) = v, we have a contradiction 
immediately. Otherwise, we have a^ s (v) £ W and 
so (since ct_ s (u)(t_ s (w) e E) cr_ s (u) e di nt W, also a 
contradiction. 

This verifies tp s (x) £ C3. Because int7 is disjoint 
from the interiors of the remaining cutsets in r(/) and 
the operation that creates the elements of f s (x) on ly 
modifies X inside W it follows that f s {x) ^Cs(p'). □ 

Claim 4.2. Given x' € fs{x)> X can be uniquely 
reconstructed from W and s. 

Proof: Following [5], we may reconstruct x as follows. 

/ x» ifveV\W 
x(v) ~ \ f(x'(v s (v))) itveW. 

□ 

We define the one-to-many map ip from C 3 {p) to 
Ca(p') by setting ^(x) = <Ps{x) f° r a particular direction 
,s. To define v and s, we employ the notion of 
approximation also used in [5] and based on ideas 
introduced by Sapozhenko in |16j . For 7 € W, we say 
A C V is an approximation of 7 if 

A £ D I¥ e and A° C 

d A o (x) >2d-Vd for all x£# 

and 

ck\A* (x) >2d-Vd for all y e O \ A° , 

where dx(a?) = |9xnX|. Note that from and (|3"3j) . 
1^(7) is an approximation of 7. 

Before stating our main approximation lemma, 
which is a slight modification of [51 Lemma 2.18], it will 
be convenient to further refine our partition of cutsets. 
To this end set 

( 7 € r(7) for some X G Cf en ] 
W(w e ,w ,v) = ij: with \W°\ = w , \W £ \ = w e \. 

[ and v e W £ J 

Note that by ^33} we have | 7 | = 2d(\W°\ - \W £ \) so 
W{w e ,w , v) C W((w - w e )/2d, v). 



Lemma 4.1. For each w e , w and v there is a family 
A(w e ,w 0l v) satisfying 

\A(w e ,w ,v)\ < exp |(9 (^(w - w e )d~^ log^ d^j j 

and a map ir : W(w e ,w ,v) — » A(w e ,w ,v) such that 
for each 7 G W(w e ,w ,v) , 77(7) is an approximation 
for 7. 

Proof: See [3 Lemma 4.2]. □ 
We are now in a position to define v and s. Our 
plan for each fixed x' G Ca(p') is to fix w e ,w and 
A G W(w e ,Wo,w) and to consider the contribution to 
the sum in (|4.24[) from those x G ( P~ 1 (x') with 77(7) = A 
(where for each x, 7 is a particular 7 G L(/)nW(ci, vi)). 
We will try to define v in such a way that each of these 
individual contributions to (|4.24[) is small; to succeed in 
this endeavor we must first choose s with care. To this 
end, given 7 G W(w e , w ,v), set 



Q t = A t nd ext (0\A u ) and Q u = (0\A u )nd ext A t , 

where A = 77(7) in the map guaranteed by Lemma |4. II 
To motivate the introduction of Q and Q° , note that 
for 7 G tt-^A) we have (by ([32]) and (133]) ) 

A 8 \ Q £ C VF £ , 
A° C 

and 

0\(i c, UQ c, )CO\W° 

It follows that for each 7 G 7r _1 (A), Q £ UQ° contains all 
vertices whose location in the partition TL,d — WUW is 
as yet unknown. We choose s(x) to be the smallest s for 
which both of |V^ S | > .8(w -w e ) and \a s (Q £ ) n Q°\ < 
5\W s \/y/d hold. This is the direction that minimizes 
the uncertainty to be resolved when we attempt to 
reconstruct x from the partial information provided 
by x' G f~ 1 (x)i s an d A. (That such an s exists is 
established in [SJ (49) and (50)] by an easy averaging 
argument). Note that s depends on 7 but not /. 

Now for each x G C 3 (p) l°t 7 G r(7) be a particular 
cutset with 7 G W(ci,ui). Let (p(x) be as defined 
before, with s as specified above. Define 



C = W s C\A° C\a s {Q £ ) 



and 



D = W S \C, 
and for each x' G <p{x) se t 



^ |cn/( x ')| ^ |c\/( x ')l \d\ 



Note that for x G KXjX') depends on W but 

not on x itself. 

Since C U D partitions W we easily have (I4.23[) . To 
obtain (|4.22l) and so ([3~7]) we must establish (|4.24l) . 

Fix iu e , w such that 2d(w — w e ) — c\. Fix 
A G -4.(iD e) w^ot Vi) an d s G {±1, . . . , ±d}. For x with 
7 G W(w e , w , v\) write x ~ s A if it holds that 71(7) = A 
and s(x) = s. We claim that with A, s, w and w e fixed, 
for x' G C 3 (p0 
(4.25) 



We could extract this directly from [5], but for the 
convenience of the reader we describe a proof below. 

Write C 3 (p)(w e ,w ,s,A,x') for the set of all x G 
C 3 (p) such that W G W(w e , w ,wi), 77(7) = A, s(x) = s 
and x' G </j(x) and set U = Q £ n <r_ s (x'). Say that a 
triple (K, L, M) is (700c! for x if it satisfies the following 
conditions. 

K U L U M is a minimal vertex cover of Q £ U Q , 
# G Q°, L CU and M CQ £ \U 



and 



K = d ext (U\L). 



We begin by establishing that x G Cs(p)(w e , ui , s, A, x') 
always has a good triple. 

Lemma 4.2. For each x G C 3 (p)(w e , iu , s, A, x') i/ie 
triple 

(K, L, M) := (W HQ°,U\ W, {Q £ \U)\W) 
is good for \. 

Proof: [8] around discussion of (54)]. □ 
In view of Lemma 14.21 there is a triple (K,L,M) 
that is good for x an d which has + |L| as small as 
possible. Choose one such, say (Ko(x) 7 L (x),M (x))- 
Set K'( X ) — Kq \ K and L'( X ) = L \L Lemma S3] 
below establishes an upper bound on v(x, x') i n terms of 
|-Ko|, \Lo\, \K'\ and |L'|, and Lemma WM shows that for 
each choice of K', L' there is at most one x contributing 
to the sum in the lemma. These two lemmas combine 
to give (|4.25|) . 

Lemma 4.3. For each x G C 3 (p)(w e , w a , s, A, x'), 




2\«o\ 



I 2 I ^\k \+\l \ 2 \k'\-\l'\ 
= B(K', L'). 



Proof: We follow [U from just before (55) to just after 
(60)], making superficial changes of notation. □ 
The inequality in Lemma 14.31 is the 3-coloring ana- 
logue of the main inequality of [5] . The key observation 
that makes this inequality useful is the following. 

Lemma 4.4. For each w e) w Q , s, A, \' > K' an d L' } 
there is at most one \ with x G Cs(p)(w e , w a , s, A, x')> 
K' = K'(x) and V = L'{x). 

Proof: In [8, (56) and following] it is shown that K' and 
L' determine W° via 

K=(K \ K') U {d ext L' n Q°) 

and so W (via W £ = {v E £ : dv C W°}). But then 
by Claim 1431 K ' and L' determine %• D 
Lemmas 14.31 andl 4.4l together now easily give (|4.25|l : 



E 

xeC 3 (p)(w e ,W ,S,A,x') 



v(x,x') < E b{k\l') 



K'CKn, L'CLn 



< 




We have now almost reached (|4.24l) . With the steps 
justified below we have that for each x' & C^{p r ) 

E KX,X') < E iKx,x'): ^Ax') 
xe<p- 1 (x') 

(4.26) 



(4.27) 
(4.28) 



< 2dc^ 1 \A{w e ,w ,v 1 )\ 

< 2dcf 1 exp{-fi(ci/d)} 

< exp{-fi(ci/d)}, 



completing the proof of (|4.24l) . In the first inequality, 
J2' is over all choices of w e , w , s and A. In (|4.26[) , 
we note that there are \A(w e ,w ,Vi)\ choices for A, 2d 
choices for s and cf^ d 1 ' choices for each of w e , w (this 
is because c\ > (w e +w ) 1 ~ 1 / d , by <[3T6]) ) , and we apply 
(|4.25l) to bound the summand. In (I4.27|) we use Lemma 
O Finally in (g^gj) we use ci > d 19 (again by ([3"lj])) 



to bound 2(^0]^ 
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